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Abstract. A simple approach is presented to study the asymptotic behavior 
of some algorithms with an underlying tree structure. It is shown that some 
asymptotic oscillating behaviors can be precisely analyzed without resorting 
to complex analysis techniques as it is usually done in this context. A new 
explicit representation of periodic functions involved is obtained at the same 
time. 



Contents 



Introduction! 1 



2. A Binary Splitting algorithm! 3 

3. Ethernet" 8 



4. Extensions! 11 



5. Appendix on Poisson Processes! 13 
References! 14 



1. Introduction 

Algorithms with an underlying tree structure are quite common in computer sci- 
ence, they are used to sort, store and search. . . See Cormen et al. and Knuth [T5] . 
Splitting algorithms are examples of such algorithms, they can be described as fol- 
lows 

Splitting Algorithm S(n) 

— Termination Condition. 

For some subset F of N, if n € F 
— ► Stop. 

— Tree Structure. 

Randomly divide n into n\,. . ., rid, with ni + • • • + n<j = n. 
— > Apply S(m), S{n 2 ), S{n d ). 

To the algorithm S(n) is attached a cost R n which can be the number of steps 
required to terminate for example. This cost is assumed to be additive that is, with 
the above notations, the relation 

(1) Rn = 1 + R ni + Rn 2 + ' ' ' + Rn d , 

holds. Note that R n is a random variable since the algorithm divides randomly into 
subgroups. This quantity can be thought as the cost of processing n items. If E(i?„) 
is its expected value, E(i?„)/n is the average processing time of one item among 
n. From a probabilistic point of view, it is natural to expect that the sequence 
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(Rn) satisfies a kind of law of large numbers, i.e. that (E(R n )/n) converges to 
some quantity a. The constant a would be, in some sense, the asymptotic average 
processing time of an item. In the context of a communication network, R n is 
the total transmission time of n initial messages, l/a would be the asymptotic 
throughput of the protocol. 

Curiously, this law of large numbers does not always hold. In some situations, the 
sequence (E(R n )/n) does not converge at all and, moreover, exhibits an oscillating 
behavior. Up to now, these phenomena have been mostly analyzed by using sophis- 
ticated complex analysis techniques (via functional transforms) by Knuth |15|. Fla- 
jolet et al. 9 and many others. See Mahmoud |17| for a comprehensive treatment 
of this approach. For alternative methods using real analysis on related problems, 
see the nice paper by Delange |H] and also Pippenger |2()j . 

In this paper, a direct, simple, approach is proposed to study these uncommon 
laws of large numbers. At the same time, it sheds a new light on the oscillating 
phenomena involved. First, the classical approach, i.e. with complex analysis, is 
briefly recalled. 

Analytic approach: An excursion inside the complex plane. A classical 
method to derive asymptotics of a sequence (r n ) related to a splitting algorithm 
consists in taking successive transforms: 

/<->\ / \ Poisson / n xTL -x Mellin », , f + °° , % s _i , 

2) (r n ) — > r(x) = } r n —e x — ► r*(s)= / r (x)x s dx. 

n>0 U 

The Poisson transform step may be, sometimes, skipped: 

(3) (r w ) M -^ n r-(.) = 5> * 

n>l 

In this way, Equation JQ) can be translated, via a Poisson transform, into a func- 
tional equation of the form 

(4) r(x) = r(j3x) + h(x), x>0, 

where < (3 < 1 and h is some fixed function. Provided that an iteration scheme 
is valid, the problem is then to get an asymptotic expansion of the series 

r(x) = r(0) + h({3 n x) 

n>0 

as x goes to infinity. This is usually done by taking the Mellin transform of the 
function x — * r(x). 

Mellin Transform. The Mellin transform r* is defined in a vertical strip of C and, 
under some growth conditions, the asymptotic behavior of r n [resp. r(x)], as n 
[resp. x] gets large, can be expressed by using the poles of r* on the right of the 
strip (provided that some growth conditions of the Mellin transform are satisfied) . 
See Flajolet et al. UJ- When the Poisson transform is used, the next step is to 
justify that the behavior of r(x) as x gets large is similar to the behavior of r n for 
n large. 
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Poisson Transform. If (iV([0, u]), u > 0) is a Poisson process with parameter 1 (see 
the Appendix for a quick presentation): for u > 0, the variable N([0,x]) has a 
Poisson distribution with parameter x. The function x — > r(a;) can therefore be 
expressed as 

r(x) = E (rjv([o )X ])) , 

for x > 0. The law of large numbers for Poisson processes states that, almost surely, 
N([0,x])/x ~ 1 as i gets large, this suggests that rjvrro.al) ~ f\ x \ P rov ided that 
the sequence (r„) does not vary too much: Recall that, due to the central limit 
theorem, the approximation N([0, x]) ~ x is valid with an error of the order of y/x. 
Analytically, some conditions on the sequence (r„) can be formulated so that an 
equivalence between the asymptotic behaviors of the sequence and of the Poisson 
transform can be established. See Jacquet and Szpankowski ^2] f° r example. 

A direct method. The approach presented here relies on a convenient use of 
Fubini's Theorem combined, sometimes, with some elementary properties of Poisson 
processes. The purpose of the paper is to show that some asymptotic results for 
algorithms on trees can be obtained in an elementary way. To show the effectiveness 
of the method, expansions with oscillating behaviors, which are usually analyzed 
with quite technical results of complex analysis, are obtained with this approach. 

With the analytical approach, oscillating expansions are described with a pe- 
riodic function which shows up through its Fourier coefficients. It occurs when 
the Mellin transform of the sequence has poles on an imaginary axis at the points 
(a + inb, n € Z). The method presented here has the advantage of being more direct 
and to give an explicit expression of the mysterious periodic function which, in the 
end, is not mysterious at all. 

Rather than setting up a framework with formal results, the presentation of some 
important and interesting algorithms already analyzed in the literature has been 
chosen. Section studies one of the first algorithms for which a curious oscillating 
behavior has been proved (by Knuth) . Section [3| considers the basic algorithm of 
the Ethernet protocol. It is not, strictly speaking, a law of large numbers setting 
but a similar oscillating behavior occurs also in this case. Moreover, it is not only 
true for the averages but also for the distribution of the variables. Section 0] gives 
other examples where similar methods can be used. In particular, a treatment of 
Equation Q is proposed. The examples of Section and Section 0] show also that 
the method does not give only a first order term of the asymptotic expansion (i.e. 
at the level of the law of large numbers) but can also give subsequent terms of the 
expansion. A more complicated splitting algorithm is analyzed in Mohamed and 
Robert [Ej. 

Acknowledgements. The author is grateful to Philippe Flajolet for uncountable 
interesting conversations on tree algorithms and Mellin transforms. The paper also 
benefited from two anonymous referees' comments. 

2. A Binary Splitting algorithm 

This section analyzes an algorithm investigated by Knuth in 1973, see Knuth [TB] 
page 131-132. It consists in splitting randomly and recursively a group of n initial 
items into subgroups until each of the subgroups has cardinality or 1. Latter, it 
has been used by Capetanakis [3] and Tsybakov and Mikhailov in the design 
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of algorithms for access protocols to communication channels. See also the surveys 
Flajolet and Jacquet ^H] and Ephremides and Hajek |7j. 

Binary Splitting Algorithm JC(n) 
— n = or 1 — ► Stop. 

- n > 2. The n elements are randomly, equally, divided into two subgroups. 
If rii and n — rii denote the cardinalities of these subgroups then 

— ► Apply K,(m) and K(n - ni). 

The cost R n of the algorithm IC(n) starting with n elements is defined as the 
number of steps it requires to stop, in particular, Rq = Ri = 1. The following 
proposition establishes a recurrence relation for the other values of n. It is simply 
a formal rephrasing of the description of the algorithm. 

Proposition 1 (Stochastic Recurrence Relation). For n > 2, the random variable 
R n satisfies the relation 

R n dl = l + Rl_ Sn +R 2 Sn , n > 2, 

with S n = B\ H h B n and 

- the i.i.d. variables B i; i > 1 are Bernoulli with parameter 1/2; 

- for < k < n, the variables R\ and R\_ k are independent and R\ and R^ 
have the same distribution as R k . 

The Bernoulli variables have the following interpretation: For i > 1, if B^ = 
[resp. Bi = 1 ] the ith element goes in the first [resp. second] subgroup. 

The recurrence relation for (R n ) and the boundary conditions at n = and 1 
can be integrated in the following way, 

(5) R n d =' 1 + Rn-S n + Rs n ~ 2{n<l}- 

The rest of the section is devoted to the analysis of the asymptotic behavior of the 
average value E(i? n ) of the random variable R n when n is large. 

Proposition 2 (Poisson Transform). For x > 0, the Poisson transform r(x) of the 
sequence (E(i?„)) is given by the series 

(6) r{x) = E ( R n)^y e- x = 1 + 2 £ 2 fe P (t 2 < x/2 k ) , 

n=0 ' k>0 

where t% is the sum of two independent exponentially distributed random variables 
with parameter 1 . 

Proof. If N = (N([0,t])), (t n ) is a Poisson process with intensity 1. See the appen- 
dix. The Poisson transform r(x) can be expressed as 

r(x) = E (R N ([ ,x])) , 

For n > 0, S n denotes the nth partial sum of Bernoulli random variables with 
parameter 1/2. Relation © gives the identity 

Rn([0,x]) ^ ' 1 + R Sn {1 0M) + R N([0,x])SNao,*]) ~ 2 {N([0,x])<l}- 

The Poisson variable random variable AT([0, x]) is split into two random variables 
Sn([o,x\) an d -^([Oj x\) — Sn([o. x ])- According to PropositionllUlof the appendix, the 
distribution of these two (independent) random variables is Poisson with parameter 
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x/2, i.e. the distribution of N([0,x/2]). Since, by definition, t 2 is the second point 
of the Poisson process then {iV([0, x]) < 1} = {t 2 > x} and 

E (RNW,x])) = 2E (i?iV([0,x/2])) + 1 - 2P (t 2 > x) . 

If f(x) — r(x) — 1, the last identity can be written as f(x) = 2f(x/2) + 2P (t 2 < x). 
By iterating, one gets the expansion 

n-l 

/(x) = 2 n f(x/2 n ) + 2 2 fe P(t 2 < x/2 fc ). 

The limit of the sequence (2 n f(x/2 n )/x) is the limit of (r(ft) — l)/h as ft, goes to 0: 
lim 2 n f(x/2 n )/x = /(O) = E(i? ) - E(i?i) = 0. 

n — >+oc 

Representation JBJ of the Poisson transform is thus established. □ 

The following proposition establishes a useful integral representation of the quan- 
tity E(i2 n ) which gives the key of the asymptotic expansion. Its proof uses a prob- 
abilistic trick of independent interest to invert the Poisson transform. 

Proposition 3 (Probabilistic de-Poissonnization). For n > 2, 

(7) E{R n ) =4n [ 2- { - lo ^ {x)} {n - 1) (1 - x) n ~ 2 dx - 1, 

Jo 

with {y} = y — [_y\ , the fractional part of y G R. 

Proof. As in the proof of Proposition El Af = (N([0,x])) = (t n ) denotes a Poisson 
point process with intensity 1. By using Equation @ and by decomposing according 
to the number of t n 's in the interval [0,ir], one gets 

r{x) = l + 2^2 fe P(t 2 <x/2 k ) 

k>0 

= 1 + 2 J2 2k J2 P (* 2 - z/2 fc ,iV([G, %]) = n) 

k>Q n>2 

= 1 + 2 J2 2 fc p ( i2 - a; / 2fc 1 ^(t ' *]) = n ) — e ~ x - 

k>0 n>2 ' 

According to Proposition^] conditionally on the event {N([0,x]) = n}, the vari- 
able (ti, t 2 , ■ ■ ■ t n ) has the same distribution as (xUJ\y xUjly xU?^), where 
(Ui,l < i < n) are independent random variables uniformly distributed on [0,1] 
and (U^y 1 < i < n) denotes their non-decreasing reordering. In particular, condi- 
tionally on the event {N([0, x]) = n}, the variable t 2 has the same distribution as 
xU™2) j hence the quantity 

P (t 2 < x/2 k I N([0, x])=n)=F (xU 2 , n < x/2 k ) = P (U 2 , n < l/2 k ) 
does not depend on x. The Poisson transform can thus be written as 

(8) r(x) = 1 + 2 2* £ P ^ 1 /2' £ ) ^ e- 

fe>0 n>2 
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Now, by switching the two sums, one gets 

- E J + 2 E ( E 2* P < V2*) ] ^ 

n>0 ' n>2 \k>0 



- x + xe- x + J2 l + 2^2 fc P(C/ 2 ,„ < l/2 k ) ^-e 

n>2 \ fc>0 ' "' 



The last expression is clearly a Poisson transform, by identifying the coefficients, 
this yields the following formula, for n > 2, 

r„ = 1 + 2 2" P f < 4) - 1 + 2 £ 2 k E ( V,„< 1/2n ) . 



2fc 

fe>0 ' ' k>0 

By Fubini's Theorem, the sum and the expectation can also be switched (the sum- 
mands are non-negative), therefore, 

r n = l + 2E fc2 fe l {fe <_ log2([/2i „ )} j =l + 2(E(2r- lo ^^")l) - l) . 

Since, for x > 0, P(U~2, n < x) = 1 — (1 — a;)" — nx(l — a?)™ -1 , the density function 
of the variable [7 2 „ is the function a; — * n(n — l)x(l — x) n ~ 2 on [0, 1]. This gives 
the relation 

r„ = 4n / 2t- lo ^ x ^{n-l)x{l-x) n - 2 dx-l 



= 4n / 2-{- los ^ :r )>(7i-l)(l-a;) n - 2 dT-l. 
./o 

The proposition is proved. □ 

With a similar use of Fubini's Theorem as in the above proof, one gets the 
following representation of the Poisson transform of (i? n ). 

Proposition 4. The Poisson transform of the sequence (R n ) can be represented as 
(9) r(x)=4se f 2-{ log ^)- lo s^» e~ y dy + 2(1 + x) e~ x - 1, z > 0. 



If Equation J3J is more explicit than Equation (JSJ , it has nevertheless no use in the 
asymptotic analysis, Equation © is the key identity. 

Proof. By Equation (jBJ) and Fubini's Theorem, one gets 



i(r(x) - 1) = ^ 2* E (l {ta < B/2 * } ) = E ]T 2 fc 1 

fc>0 \k>0 



{t 2 <x/2 k } 



E[(2L^(^)J+i-i) l {ta ^ } ], 



the proof is concluded by trite calculations and by using the fact that the random 
variable t% has the density (a:exp(— a?)) on K + . □ 

Knuth's result on the asymptotic behavior of this algorithm is given by the 
following theorem. 
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Theorem 5 (Asymptotic oscillations). The average of R n satisfies the expansion 

(10) E(R n ) = nF(\og 2 (n))-l + 0(ne~ n ), n > I, 
with 

r+oo 

(11) F{y) = A 2- {y - los ^ x)} e- x dx. 

Jo 

The function F is periodic with period 1, in particular the sequence (E(R n )/n) is 
not converging. 

In communication networks, the quantity E(R n )/n is related to an average trans- 
mission time when n messages are initially in the network. The practical conse- 
quence of the oscillations of the sequence (E(i?„)/n) is nevertheless quite limited 
since the amplitude of the oscillations of the function F is of the order of 10 -5 . See 
Figure ^ 



n-i 

dx. 



Proof. By Equation 0), it is enough to look at the asymptotic behavior of 

2 -{- log 2 (x)} n (1 _ x y-l dx = J 2 -{log 2 (n)-log 2 ( a; )} ^ _ £ j 

The elementary inequality 

p + CO 

2 -{- \o E2 {x)} n ^ _ x y^l dx _ 2 -{\o &2 (n)-log 2 (x)} e -x dx 

n JO 



: / e~ x dx 







('-;) 



n-1 

) 

II / 



dx = 2e 



concludes the proof. □ 

Remark. By evaluating the integral of Equation Q), one gets that the quantity 
E(i?„) can be expressed as 



E(^) = l + 2^2 fc (l-(l-i) n 

k>0 V ^ ' 



n I 1 



2 fe I 1 2 k 



This is the starting point of most of analyses of this algorithm. It is followed by some 
exponential approximations, a Mellin transform of the residual series and finally 
some complex analysis arguments to derive the asymptotic behavior of the original 
sequence. The periodic function F appears then through its Fourier transform. 
By inversion, it is expressed in term of the values of the Gamma function T on a 
vertical axis of the complex plane, 

F{x) = -\h £ 6r(&-i)e 2 ^ 

fcez-{0} 

with £ fe = 2i£;7r/log2 for k € Z. 

Clearly, it is much easier to evaluate the asymptotic behavior of the integral 10. 
Moreover, as a benefit, the periodic function F shows up quite naturally and with 
a direct explicit expression which is apparently new. 
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Figure 1. The interpolated sequence n — > E(i?„)/n 
3. Ethernet 

The context of the algorithm analyzed in this section is the following: In a com- 
munication network with only one channel, at most one message can be transmitted 
each unit of time. If two transmitters (or stations) try at the same time, this is 
a failure (a collision) and both of them have to retransmit later. In an Ethernet 
network, to each station waiting for transmission is associated an integer L which 
is the number of collisions it has already experienced. At time t + 1, a station 
with a counter equal to k tries to transmit with probability l/2 k . See Metcalf and 
Boggs ^H] and Aldous £Q. For t > 0, L(t) denotes the value of the counter of a 
given station which is waiting for transmission for t units of time. In a completely 
congested network, the sequence (L(t)) evolves as follows: L(0) = 1 and 



L(t+1) 




with probability 1 — i 
1 a L ^ 



,L(t) 



where a S (0,1). For Ethernet the value of a is 1/2. An approximated counting 
algorithm proposed by Flajolet and Martin JI] uses also such a sequence (L(t)), 
see Flajolet [B]. 

For k > 1, Gk denotes the sojourn time of (L(i)) in state k, Gk is a geometrically 
distributed with parameter a k : 

nG k >n) = (l-a k ) n . 

Consequently, G\ + ■ ■ ■ + Gk-i is the hitting time of k for the process (L(t)). Note 
that the random variables (Gk) are, of course, independent. 

The average hitting time of k is therefore a~ k , this suggests that the value of 
L(t) is of the order of log 1 / a (t) when t is large. This approximation is indeed 
true : it is not difficult to show that the quantity \E(L(t)) — log 1 / a (t)| is bounded 
with respect to t. This estimation suggests that there should be a convergence in 
distribution of the random variable L(t)— log 1 / a (t) as t goes to infinity. Surprisingly, 
this convergence does not hold. With calculus on some alternating series, Mellin 
transforms and complex analysis methods, Flajolet 8 has shown that the variable 
L(t) — \og 1 f a (t) exhibits an asymptotic oscillating behavior with respect to t. For 
an extension of these results, see also Kirschenhofer et al. ^3 • 
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In this section, a simple proof of this result is presented, together with an explicit 
description of the asymptotic behavior of the distribution of the random variable 
L(t) — logi / a (t). The following elementary proposition simplifies a lot the analysis 
of the algorithm. 

Proposition 6. The convergence in distribution 

n—l +00 

(12) a n J2 Gn-k^H d ^ J2 * k Ek 

fe=0 fc=0 

holds as n goes to infinity, where (E n ) is a sequence of independent random variables 
exponentially distributed with parameter 1. The random variable H has a density 
h on K + given by 

+00 

(13) h(x) = — — V =s— ; T-a-' n exp (-a" n a;) , x > 0. 

Moreover, Convergence (|12|) is true for the total variation norm, i.e. 



(14) lim sup 

X>Q 



J2 ""Gn-k > X - P {H > X) 



\k=Q 



= 0. 



The random variable H is already known in the domain of communication net- 
works, but in a very different framework: The congestion avoidance phase of the 
Transmission Control Protocol (TCP) of the Internet. In this case, H is related 
to the stationary distribution of the throughput of a long TCP connection. See 
Dumas et al. |S] . The variable H also appeared in mathematical finance models to 
describe Asian Options, see Carmona et al. 0]. 

Proof. For fc > and x > 0, then 

P(a n G n -k >x) = (l- a "- fe )^/°"l 

= exp {\x/a n } log (1 - a"- fe )) ~ exp ( - x/a k ^j , 

hence the random variable a n G n -k converges in distribution to a k Ek where Ek 
is exponentially distributed with parameter 1. Therefore, by independence of the 
variables (Gk), the convergence (|12|) holds. For the explicit expression of the density 
of H, see the proof of Proposition 13 of Dumas et al. 0. 
Chebychev's Inequality gives, for x > 0, 




x \ f— ' / x I 



hence, the uniform convergence l|14H has only to be proved in a compact interval. 
Since the sequence n — > P (a n G n -k > x) is non-decreasing, the same property holds 
for the function /„ defined as follows 



/ n-l N 

/„(i) d ^Pfl"^G„_ t >x 

V k=Q J 



By Dini's Theorem, the non-decreasing sequence of functions (/ n ) converges uni- 
formly on compact sets to x — > V(H > x). The uniform convergence Ijl4(l is estab- 
lished . □ 
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Figure 2. The density function of H when a = 1/2 



For t > 1 and it 6 [0, 1], it is easy to check that 

e (>(*>) = e f « n (i - «) i{i(t)<»} ) = E - u ) p ( l w ^ n ) 

\n>0 / n>0 

holds and, since P(L(i) < n) = P (Gi H h G n > i), for n > 1. One thus gets 

the relation 

E [exp (-AZ(i))] = E e_A " C 1 ~ e_A ) P ( a " E G «-* ^ > A > °- 

n>0 V i=0 / 

The uniform convergence Ijl4|) gives the expansion, as t goes to infinity, 
E [exp (-AL(t))] = J2 e ~ Xn i 1 - e_A ) ^(H> a n t) + o(l), 



n>0 



by switching the sum and the expectation, one gets 

E[exp(-AL(t))] • ( E e- An (l - e ~ A ) l {H > a ~ t} 



=E| E 

v «>rio gl/a (t/ff)i 

E ' 



(l-e- A )l {t > H} 
E 



(-AlTog 1/o (t/ff)l 



exp ( -\{\og 1/a (t/H)]) l{t> H ) ~K ' \!» 
The following equivalence of Laplace transforms has therefore been obtained: 

E (exp [-A (L(t) - Iog 1/O (t))]) ~ E (exp [-A (jlogy a (t/H)] - log 1/o (*))]) 

= E (exp [-A (l - log 1/B (*0 - {log 1/a (Vff)})]) • 

The above relation implies that if x > and Z(t) = L(t) — log x / a (t) then the 
sequence (Z(a~ n ~ x )) converges in distribution. The main result of Flajolet [Hj can 
now be stated. 
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Theorem 7 (Asymptotic Oscillating Distribution). Asymptotically, the distribu- 
tion of L(t) — log]y a (f) is equivalent to the distribution of F(\og 1 / a (t)) where, for 
x > 0, F(x) is defined as 

F(x) = log 1/a (J-^ -{x- Iog 1/a (H)} , 

where H is the random variable introduced in Proposition^ 

Note that the random function x — > F(x) is periodic with period 1. The variable 
H is in fact concentrated around its average away from (see the detailed study 
by Litvak and den Zwet and from +oo (exponential decay). The phenomenon 
of moderate oscillations (but not as small) seen for Knuth's Algorithm is thus also 
true in this case. 



4. Extensions 

In Section[21 the asymptotic behavior of the sequence (E(i?„) is directly obtained 
from Relation Q. To obtain this identity, the key steps are 1) the de-Poissonization 
and 2) the use of Fubini's Theorem in Equation © to remove the series. In Sec- 
tion [3 the key step is also the use of Fubini's Theorem in Equation (|15fl to get rid 
of the series. 

It may be thought that these derivations are nevertheless possible only because of 
the particular expressions involved: For example, in Sectional the special properties 
of Poisson processes are critical in the solution of the problem. The purpose of this 
section is to propose a simple method along the same lines to study the asymptotic 
behavior of some series and functions. As it will be seen, it applies in various sit- 
uation, even when there is no probabilistic interpretation of the sequence/function 
under study. 

Dyadic Sums. This example is taken from Flajolet et al. jH] page 35. The behavior 
of the function 

(16) G(x)=J29(x/2 k ) 

k>l 

is investigated when x goes to infinity. 

It is assumed that x — > g(x) is differentiable and that g(0) = so that the sum 
is well defined. These conditions are not the weakest possible. The goal here is to 
keep the presentation as simple as possible, not to get the most accurate result for 
this special case. 

By using the elementary relation, for x > 0, 



g( x ) = / g'{u)du, 

Jo 

the series (|16|) can be written as 

G ( x )=^2 1 {u<x/2"}9'(u)du. 
k>i Ja 

When the sum and the integral are permuted in the last expression, it yields 

f-X f-X 

\ ^2 1 {u<x/2"}g'(u)du= [\og 2 (x /u)\g'(u)du, 
Jo k>1 Jo 
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Fubini's Theorem states that this last term is indeed G(x) when the condition 

log 2 (-")| \g'(u)\ du < +00 
holds. In this case, the function G can thus be represented as 

G(x)=[ [\og 2 (x / u) \ g'(u)du 

pX 

{log 2 (z / '«)} g'(u)du+ I log 2 (x / u) g'(u)du. 
Jo 

The following proposition has been proved. 

Proposition 8. If the function g is differentiate and such that g(0) = 0, under 
the condition 

p + CC 

(17) / \\og{u)\\g'{u)\du<+™, 



the dyadic sum G(x) can be expressed as 

r +oc 1 f +oc a(u) 

(18) G(x) = F(log 2 (x)) + / {\ og2 {x/u)}g\u)du+— / du . 

Jx log 2 J x u 

where F is the periodic function, with period 1, defined by 

-1 /-+oo / \ p-\-oo 

F(v) = -r-Z ^du- {y-\og 2 {u)}g'{u)d Ul y>0. 

log 2 J u J 

Provided that g has a monotone behavior in the neighborhood of and +00, Con- 
dition l|17J) is equivalent to the fact that the integral 

+Q °^du 
u 

exists, i.e. that the Mellin transform of g is defined at 0. 

From Equation l|18|) , it is then not difficult to derive an asymptotic expansion of 
G{x) as x goes to infinity. 

Harmonic Sums. More generally, the simple method developed above can be 
used for a more general class of functions: 

k>l 

This is the main application of Flajolet et al. J3- For simplicity, it is assumed that 
the sequence (pk) is non-increasing and converging to for example. Provided that 
Fubini's Theorem can be applied, one gets the following representation: 

r +00 



r+00 

G ( x ) = I ^2^k^{f, k >u/x} g'(u)du. 

J ° k>l 

If, for y > and n > 1, 

n 

r(y) = sup{fc : fi k > y} and A(n) = A fc , 



k=l 



ON THE ASYMPTOTIC BEHAVIOR OF SOME ALGORITHMS 



13 



with the convention that sup{0} = 0, then the harmonic sum can be written as 

r + co 

G(x) = / A(t(x/u)) g'(u) du. 
Jo 

Hence, the rates of growth of A and r give the key of the asymptotic behavior of 
the function G. 

Final Remark. It must be noted that, when it can be applied, the method pro- 
posed in this paper will require weaker assumptions than the analytic approach. 
Indeed, the conditions to apply Fubini's Theorem are quite minimal (which does 
not mean that there is no condition at all). On the contrary, the use of Mcllin 
transform implies the existence of the transform itself on some specified strip of the 
complex plane together with some growth conditions at infinity. 

5. Appendix on Poisson Processes 

To keep the paper self-contained, this section recalls the basic definitions and 
results concerning Poisson processes used in this paper. See also Kingman ^31 
and Chapter 1 of Robert [2] for a more detailed presentation of these important 
stochastic processes. 

Definition 9. A random variable X has the Poisson distribution with parameter 
X whenever 

A™ 

¥(X = n) = — e~ x , neN. 

The following proposition is the basic property that motivates the use of Pois- 
son transform when dealing with splitting algorithms. It is a striking elementary 
property of the Poisson distribution. 

Proposition 10 (Splitting Property). If X is a Poisson random variable with 
parameter A, X balls are thrown randomly among n urns and for 1 < i < n, Xi 
denotes the number of balls in the ith urn. The variables (Xi) are independent with 
a common Poisson distribution with parameter X/n. 

Definition 11. A Poisson process with intensity A is an increasing sequence (t n ) 
of positive random variables such that 

— the increments t n+ \ — t n , n > 1, are independent; 

- For n > 1 and x > 0, P(i„+i — t n > x) = exp(— Ax). 

A Poisson process (t n ) is also represented as a non-decreasing integer valued 
function (iV([0, t]),t> 0) on R+ where, for t > 0, N([0, t]) is the number of t n 's in 
the interval [0, t}: N([0, t]) — n on the event {t n < t < t n +i}- For a Poisson process, 
the representations as a sequence (t n ) or a non-decreasing function (iV([0, t]),t > 0) 
are or course equivalent. 

Proposition 12. It (t n ) is a Poisson process with intensity A, for t > 0, 

- the variable N([0,t]) has a Poisson distribution with parameter X; 

— conditionally on the event {N([0,t\) — n}, the variables 

o < h < t 2 < ■ ■ ■ < t n < t 

have the same distribution as the reordering of n independent, uniformly 
distributed random variables on [0,t]. 
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